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PRELIMINARY AMENDMENT 



Assistant Commissioner for Patents 
Washington, D.C. 20231 



Sir: 



Prior to examination, please amend the above-identified application as follows: 



IN THE ABSTRACT : 



Kindly enter the Abstract submitted herewith on the sheet attached hereto. 



IN THE CLAIMS ; 



Please amend claims 1 - 14 as follows: 



1 . (Amended) A processor architecture adapted to program languages operating 
with a sequential flow of instructions and handling data through use of simple values and 
lists and dynamically allocated arrays, and comprising an instruction holding means, a data 
memory means storing data objects, and execution means, characterized by 
means for handling simple values and references to data objects in dependence of an actual 
instruction from the instruction holding means, said dependence being called a binding; 
means to increment reference counts to a data object and to decrement reference counts to a 
data object in dependence of an actual instruction from the instruction holding means, and 
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in dependence of the means, which handles simple values and references, storing a 
reference to said data object. 

2. (Amended) A processor architecture according to claim 1, characterized by 
means for handling storage of simple data and references to data objects in the means, said 
stored data and references to data objects being referred to, by means of identifiers, from 
instructions /rom the instruction holding means; storage means in the means for handling 
storage of simple data and references to data objects. 

3. (Amended) A processor architecture according to claim 1, characterized in that 
the means for handling the storage of values comprises a parameter memory means having 
means for keeping notice of the bindings to the stored values, and having storage means for 
storing said values. 

4. (Amended) A processor architecture according to claim 3, characterized in that 
said storage means in the means for handling temporary storage of values stores values, 
which comprises at least type information and value information. 

5. (Amended) A processor architecture according to claim 4, characterized in that 
the parameter memory is used to transfer and store function arguments, such that argument 
values are stored in the parameter memory in dependence of an actual instruction from the 
instruction holding means, then bond to certain identifiers in the instructions, and that said 
data values are arranged to be read from the parameter memory in dependence of said 
identifiers in an actual instruction. 

6, (Amended) A processor architecture according to claim 3, characterized by 
means in the parameter memory for storing and managing scope information for the stored 
parameters, where the scope information determines which parameters are currently valid 
and eligible to be read out from the storage. 
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7. (Amended) A processor architecture according to claim 3, characterized by 
means in the parameter memory for storing and managing information for scope and 
values, where the means is used for storing and managing information for scope and data 
values, where the process information determines which scopes and values are currently 
valid and eligible to be read out from the storage. 

8. (Amended) A process architecture according to claim 3, characterized by a 
process identification register for identification of the currently executed process; a scope 
identification register for identification of the currently valid scope. 

9. (Amended) A process architecture according to claim 3, characterized in that at 
least the top of at least one priority queue of processes to be executed is kept available for 
reading, and that at least part of the process descriptor of the next process to be executed is 
kept available for reading. 

10. (Amended) A process architecture according to claim 9, characterized in that 
in order to make a process switch the means for handling values and references: 

creates a new scope and at least the program counter is stored in the 
parameter memory using said new scope; 

stores said new scope value in the process descriptor of the current process, 
said process descriptor may be stored in the data memory; 

restores the scope value for the process switch from the process descriptor 
of said process; 

sets the process switch to be the current process; and 

reads at least the program counter from the parameter memory and performs 

the restoring. 

1 1 . (Amended) A process architecture according to claim 1 , characterized by 
instructions having only one instruction format, where each instruction is composed of a 
distinct number of sub-instructions, each of which has in turn the same and only one format 
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comprising a first part and a second part, the first part determining the action to take and 
the second part providing a value to use in that action. 

12. (Amended) A process architecture according to claim 1, adapted to execution 
of languages using functions and dynamic memory allocation characterized by a set of 
instructions comprising dedicated instructions for making function calls, function returns, 
and parameter transfer between functions. 

13. (Amended) A process architecture according to claim 11, characterized by a 
set of instructions comprising dedicated instructions for incrementing or decrementing the 
number of references to data objects stored in the data memory. 

14. (Amended) A process architecture according to claim 1, characterized in that 
it is adapted to process parts of computer programs written in a functional language. 
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REMARKS 

The claims of the originally-filed application were drafted in accordance with a 
foreign patent practice. The claims are hereby amended merely to present an initial set of 
claims for examination that conform to U.S. patent practice. 



Respectfully submitted. 

Burns, Doane, Swecker & Mathis, L.L.P. 




Kenneth B. Leffler 
Registration No. 36,075 



P.O. Box 1404 

Alexandria, Virginia 22313-1404 
(703) 836-6620 



Date: January 18, 2002 



10/031332 

Attorney's Docket No. n00500-335 
Aftach ment to Preliminary Amendment dated J anuary 18. 2002 
M arked-up claims 1-14 

1. (Amended) A processor architecture adapted to program languages operating 
with a sequential flow of instructions and handling data through use of simple values and 
lists and dynamically allocated arrays, and comprising an instruction holding means [(2,3)], 
a data memory means [(5)] storing data objects, and execution means [(7)], characterized 
by 

means [(4)] for handling simple values and references to data objects in dependence of an 
actual instruction from the instruction holding means [(2,3)], said dependence being called 
a binding; 

means [(6)] to increment reference counts to a data object and to decrement reference 
counts to a data object in dependence of an actual instruction from the instruction holding 
means [(2,3)], and in dependence of the means [(4)], which handles simple values and 
references, storing a reference to said data object. 

2. (Amended) A processor architecture according to claim 1,. characterized by 
means [(4)] for handling storage of simple data and references to data objects in the means 
[(6)], said stored data and references to data objects being referred to, by means of 
identifiers, from instructions from the instruction holding means [(2,3)]; storage means 
[(13)] in the means [(4)] for handling storage of simple data and references to data objects. 

3. (Amended) A processor architecture according to [any of the preceding claims] 
claiml, characterized in that the means [(4)] for handling the storage of values comprises a 
parameter memory means [(4)] having means [(10,11,12)] for keeping notice of the 
bindings to the stored values, and having storage means [(13)] for storing said values. 

4. (Amended) A processor architecture according to claim 3, characterized in that 
said storage means [(13)] in the means [(4)] for handling temporary storage of values stores 
values, which comprises at least type information and value information. 
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M arked-up claims 1-14 

5. (Amended) A processor architecture according to claim 4, characterized in that 
the parameter memory [(4)] is used to transfer and store function arguments, such that 
argument values are stored in the parameter memory [(4)] in dependence of an actual 
instruction from the instruction holding means [(2,3)], then bond to certain identifiers in 
the instructions, and that said data values are arranged to be read from the parameter 
memory [(4)] in dependence of said identifiers in an actual instruction. 

6. (Amended) A processor architecture according to [any of the claims 3 to 5] 
claim 3 , characterized by means [(11)] in the parameter memory [(4)] for storing and 
managing scope information for the stored parameters, where the scope information 
determines which parameters are currently valid and eligible to be read out from the 
storage [(13)]. 

7. (Amended) A processor architecture according to [any of the claims 3 to 6] 
claim 3 . characterized by means [(10-13)] in the parameter memory [(4)] for storing and 
managing information for scope and values, where the means [(10)] is used for storing and 
managing information for scope and data values, where the process information determines 
which scopes and values are currently valid and eligible to be read out from the storage 
[(13)]. 

8. (Amended) A process architecture according to [any of the claims 3 to 5] 
claim 3 , characterized by a process identification register [(PROCESS 1; PROCESS 2)] for 
identification of the currently executed process; a scope identification register [(ENV 1, 
ENV 2)] for identification of the currently valid scope [(fig. 4)]. 

9. (Amended) A process architecture according to [any of the claims 3 to 8] 
claim 3 , characterized in that at least the top of at least one priority queue of processes to 
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be executed [are] is kept available for reading, and that at least part of the process 
descriptor of the next process to be executed [are] is kept available for reading. 

10. (Amended) A process architecture according to claim 9^ characterized in that 
in order to make a process switch the means [(4,5,6)] for handling values and references: 

[*] creates a new scope and at least the program counter is stored in the 
parameter memory [(4)] using said new scope; 

[*] stores said new scope value in the process descriptor of the current process, 
said process descriptor may be stored in the data memory [(5)]; 

[*] restores the scope value for the process switch from the process descriptor 
of said process; 

[*] sets the process switch to be the current process; and 

[*] reads at least the program counter from the parameter memory [(4)] and 
performs the restoring. 

1 1 . (Amended) A process architecture according to [any of the preceding claims] 
claim 1 . characterized by instructions having only one instruction format, where each 
instruction is composed of a distinct number of sub-instructions, each of which has in turn 
the same and only one format comprising a first part and a second part, the first part deter- 
mining the action to take and the second part providing a value to use in that action. 

12. (Amended) A process architecture according to [any of the preceding claims] 
claim 1, adapted to execution of languages using fimctions and dynamic memory allocation 
characterized by a set of instructions comprising dedicated instructions for making function 
calls, function returns, and parameter transfer between functions. 
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Marked-up claims 1-14 

13. (Amended) A process architecture according to claim 11, characterized by a 
set of instructions comprising dedicated instructions for incrementing or decrementing the 
number of references to data objects stored in the data memory [(5)] . 

14. (Amended) A process architecture according to [any of the preceding claims] 
claim 1 . characterized in that it is adapted to process parts of computer programs written in 
a functional language. 
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ABSTRACT 



A processor architecture is adapted to program languages operating with a 
sequential instruction flow and handling data through use of lists or tuples or simple types. 
It comprises a program holding means, an instruction holding means, a data memory 
means storing data objects, and an execution means. Means are provided for handling 
references to data objects referenced by bindings and comprising means to increment 
reference counts to a data object and to decrement reference counts to a data object in 
dependence of an actual instruction from the instruction holding means. 
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A processor architecture 

This invention relates to a processor architecture of the kind disclosed ia the pream- 
ble of claim L The processor is particularly, but not exclusively, adapted to execu- 
tion of functional programs. 

TECHNICAL FIELD OF THE INVENTION 

Program development is very costiy. These costs could sometimes be reduced if the 
program could be made in a functional language since it is hard for a user to use a 
machine programmed in a conventional language - it requires a lot of knowledge - and 
because of the complexity a programmer might introduce hidden errors. 

The development of prograrm nmp; languages 

The development of the first electronic con^uter started the development of several 
programmitig languages suited for this type of computer, such as FORTRAN, COBOL, 
Algol, BASIC, Pascal. These languages have been called imperative languages, below 
also called conventional languages, mainly because of the fact tiiat they normally give 
programs that consist of a sequence of commands or instractions to be executed 
sequentially by a conventional computer, Le. a computer designed according to the 
principles developed by John von Neumann. Imperative kind of programs have became 
uicreasingly complex and often contain a lot of errors, are dfficult to read, difficult to 
understand and particularly hard to modify. 

An increasing discomfort with these imperative languages led to the development of 
another series of languages, so called functional languages: LISP, ISWIM, Scheme (a 
dialect of LISP), ML, Hope, SASL, and so on. The driving force behind the 
development of Ihese languages was conceptual simplicity; no particular machine 
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influenced the design. Functional progranmiing languages have several properties 
alleviating some of the disadvantages of the more conventional progranmiing languages. 

For additional information and understanding we refer to the textbook "Functional 
Programming Using Standard ML", Ake Wikstrom, Prentice Hall 1987. 

A program written in a functional language can be seen as a set of deJBnitions of 
properties of objects and as computation rules. The definitions are the declarative part 
and the computation rules are the operational part that the computer uses during 
executioiL Functional languages provide a higher-level inter&ce to the computer, which 
makes it possible for the programmer to abstract away from hardware-related details of 
the computer. As a positive side-effect, functional programs are often shorter and easier 
to understand than conventional imperative programs. 

Nowadays functional languages are implemented as a virtual machine on a conventional 
processor. This has been done with compilers or inteipretating programs. This means 
that a program is executed which interprets the program instractions. Every program 
instruction results in that a number of machine instructions are executed. The execution 
will therefore be slow. 

It is clear that some of the benefits of the functional program approach have been held 
back by the fact that practically no useful dedicated hardware has been on the market for 
the process of storing and executing functional programs in an effective manner. Some 
processors adapted to execute functional programming languages, so called FFP 
machines, are discussed in the textbook "High-level Language Computer Architecture" 
(ISBN 0,88175-1342-4) from 1988, chapters 11 and 12, and have been constructed and 
manufactured from time to time. Dedicated Lisp working stations were for instance 
furnished to the middle of the eighties, A transputer from Inmos Ltd, which is a 
processor for built-in systems was introduced in the middle of the eighties and is still on 
the market 
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A particular kind of functional language is Erlang, which is developed by the Applicant 
for real-time applications. In spite of the disadvantage that functional programs are slow 
to execute on the von Neuman kind of computer, much slower than the imperative 
languages, the use of Erlang is increasing. However, the capacity of the conq^uters 
5 having Erlang installed is not enough for some ^plications. Eriang programs are built 
as a nimiber of communicating processes, also called processes. A switch is often made 
between execution between different processes. This switch is unproductive and should 
be made as fast as possible. For the moment being this is made in software, in a so 
called *'run-time system". 

10 

Since data programs written in functional languages, such as Erlang, provided on 
conventional computers are very power consumiag, too much for a lot of applications, 
there is a need for providrag a hardware having a low power dissipation. Low power 
dissipation is essential in most products. Demand for low power dissipation wiU be 
15 increasing in the future. High power dissipation hinders further processor speed 
improvement 

SUMMARY 

20 OBJECTS OF THE INVENTION 

An object of the invention is to provide a micro processor which is adapted to execute 
programs or at least parts of programs written in a language having a sequential 
instruction flow, for iostance in Erlang. 

25 

Another object of the invention is to provide a co-processor executing a functional 
language in a real-time operative system. One or several co-processors of this kind 
should be able to handle processes demanding great capacity. 



30 



Still another object of the invention is to provide a processor, particularly dedicated to 
Erlang. 
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Yet another object of the invention is to provide a processor adapted to a functional 
language in wireless portable equipment Processors in wireless portable equipment are 
provided with a content of software (SW) which often need be changed In such a case a 
5 processor adapted to handle a fimctional language, such as Erlang, should give a great 
advantage. 

Still another object of the invention is to provide a processor adapted to a fimctional 
language having fewer memoiy accesses than what is conomon today. This means a 
10 need for a high instruction density. 

Another object of the invention is to provide a device able to handle much functionaUty 
implemented in software and which has a low or reasonable power dissipation, thus to 
provide a great amount of ftmctionalily to low power consunq)tioiL 

15 

Yet another object of the invention is to provide dedicated hardware support for garbage 
collection and process switching in the processor in order to improve execution 
performance and lower power dissipation. Such mechanisms are conamonly 
implemented in a "run time system" in software. 

20 

INVENTION 

The invention relates to a processor architecture adapted to program languages op- 
25 erating with a sequential instruction flow and handling data through use of Usts or 
tuples or simple types, and comprising an instruction holding means, a data memory 
means storing data objects, and execution means. The objects mentioned above can 
be solved by providing means for handling references to data objects referenced by 
bindings and comprising means to increment reference counts to a data object and to 
30 decrement reference counts to a data object in dependence of an actual instruction 
ftom the instruction holding means. 
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Storage means could be provided in the means for handling temporary storage of 
data, and to keep notice of bindings to said temporary storage. When the notice 
keeping means detects generation of a zero reference to an object, meaning that this 
object is not needed anymore, it preferably makes the memory slot for that object 
available as a free memory slot The means for handling temporary storage of data 
comprises preferably a parameter memory means having means for keeping notice 
of the bindings to the stored values, and haviag means for storage of said data val- 
ues. 

Value storage means could store values and type information for the values fed to 
the parameter memory. The values, being a part of the data objects, have then the 
bindings to the parameters. The parameter memory will then transfer values be- 
tween functions using the parameters and using the parameters for temporal storage. 
The parameter memory coidd replace parameta references in fetched instractions 
from the instruction memory means with stored actual values before computation. 
Means could be provided in the parameter memory for storing and managing envi- 
ronment information for the parameters, where the environment determines which 
parameters are currently valid parameters. Means could be provided in the parame- 
ter memory for storing and managing information for parameters and environments, 
where process information fed to the parameter memory is used to deteimine which 
environments and which parameters are currently valid. 

A process identification register could be provided for identification of the currently 
executed process, and an environment identification register for identification of the 
currently executed environment. At least the top of at least one priority queue of 
processes to be executed is preferably kept available for reading. At least part of the 
process descriptor of the next to be executed process is then kept available for 
reading in the parameter memory means. In order to make a process switch: 
* a new environment is created in the parameter memory, and at least 
the program counter is stored in said new environment. 
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* said new environment value is stored in the process descriptor of the 
current process, said process descriptor may be stored in the data 
memory, 

* the enviromnent value of tilie new process is restored, 

* the new process is set to be the current process, 

* at least the program counter is restored. 

The instructions are preferably provided with only one instruction format, where 
each instruction is composed of a distinct nimiber of sub-instructions. Each sub- 
) instruction has in tum the same and only one instruction format comprising a first 
part and a second part, the first part determining the action to take and the second 
part providing a value to use in the action. 

The invention could be adapted to execution of functional languages. Then a set of 
5 instructions are created comprising dedicated instructions for function calls, func- 
tion retums, parameter transfer between functions. Then also a set of instructions 
could be created comprising dedicated instructions for incrementing and decre- 
menting of memory references. The processor architecture disclosed above could be 
adapted to process parts of computer prognms written in a functional language. 

0 

Thus, the invention relates to a processor architecture adapted to program languages 
operating with a sequential flow of instructions and handling data through use of 
simple values and lists and dynamically allocated arrays, and comprising an instruc- 
tion holding means, a data memory means storing data objects, and execution 
25 means. The invention is essentially characterized bymeans handling simple values 
and references to data objects in dependence of an actual instruction from the in- 
stmction holding means, said dependence called a binding. It has also means to in- 
crement reference counts to a data object and to decrement reference counts to a 
data object in dependence of an actual instruction from the instruction means, and in 
30 dependence of the means storing a reference to said data object. The invention could 
comprise means for handling storage of simple data and references to data objects in 
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tiie means, said stored data and references to data objects referred to, by means of 
identifiers, from iastructions from the instruction means. Storage means could be 
provided in the means for handling storage of simple data and references to data 
objects. The means for storage of values could comprise a parameter memory means 
having means for keeping notice of the bindings to the stored values, and having 
storage means for storage of said data values. The storage means could be provided 
in the means for handling temporary storage of values, where the stored values 
comprise at least type information and numeric information, and the type informa- 
tion characterizes the numeric information. 

The parameter memory could be used for transfer o^ and storage of, function argu- 
ments such that argument values are stored in the parameter memory in dependence 
of an actual instruction from the instruction means, then bond to certain identifiers 
in the instructions. The values could be read from the parameter memory in depend- 
ence of said identifiers in an actual instruction- Means could be provided in the pa- 
rameter memory for storage and managing scope information, denoted environment, 
for the stored values, where the scope determines which values are currently valid 
and eligible to be read out from the storage. Means could be provided in the pa- 
rameter memory for storage and managing information for scope and values, where 
the means is used for storing and managing process information, and the process in- 
formation determines which scopes and values are currently valid and eligible to be 
read out from the storage. A process identification register could be provided for 
identification of the currentiy executing process. A scope identification register 
could also be provided for identification of the currentiy valid scope. At least the 
top of at least one priority queue of processes to be executed could be kept available 
for reading. At least part of the process descriptor of the next to be executed process 
coidd be kept available for reading. In order to make a process switch preferably: 
* a new scope is created and at least tiie program counter is stored in the parameter 
memory using said new scope; 
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* the new scope value is stored in the process descriptor of die cirrrent process, said 
process description may be stored in die data memory; 

* the value of the scope for the switched to process is restored from the process de- 
scriptor of said process; 

* the switched to process is set to be the current process; 

* at least the program counter is read from the parameter memory and is restored. 

The invention relates also to a process architecture, which is characterised by in- 
structions having only one instruction format, where each instruction is composed 
of a distinct number of sub-instractLons, each of which has in turn the same and 
only one format comprising a first part and a second part, the first part determining 
the action to take and Ihe second part providing a value to use in that action. The 
process architecture is preferably adapted to execution of languages using functions 
and dynamic memory allocation. It could then be characterised by an instniction set 
comprising 

dedicated instructions for making function calls and transfer function argmnCTits in 
the same instruction; 

dedicated instructions for making return from function and transfer function result 
in the same instruction. 

A set of instructions could comprise dedicated sub-instructions for incrementing or 
decrementing the number of references to data objects stored in die data memory, 
where said sub-instractions specify identifiers representing bindings to the parame- 
ter memory. The parameter memory could in turn store memoiy references to the 
actual data objects referred to by said bindings. The managing of the storage utilisa- 
tion of the parameter memory is an autonomous process, which is able to read out 
values from the parameter memory and store these values in the data memory. It 
could then make storage positions in tiie parameter memory free to use, and be able 
to read back such values from the data memory and store these values back m the 
parameter memory and restore the information in for said values. When die number 
of processes having values in the parameter memory is larger than a specified limit 
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values belonging to a chosen process could be moved to the data memory. When the 
number of scopes for a process in the parameter memory is larger than a specified 
limit values belonging to a chosen scope could be moved to the data memory. When 
the next to be executed process not having values in the parameter memory, but has 

5 such values moved to the data memory, at least some of these values could be 
moved from the data memory and restored in the parameter memory. When the 
number of scopes in the paarameter memory for the current process goes below a 
specified limit, there are values for the current process moved to the data memory, 
at least some of these values could moved from the data memory and restored in the 

1 0 parameter memory. 

ADVANTAGES 

The processor architecture is designed to perform particularly well for the following 
15 cases: 

• "Functional application" which constitutes tiie controlling structure in a frmctional 
language. 

• Process switching, below called context switching (Other egressions for processes 
are tasks or threads. The expression process(es) will be used in the remaining of the 

20 text, although it should be understood that tasks or threads could be used as well). 
Normally, Telecom applications are implemented using a large number of proc- 
esses. It is essential to be able to switch between these processes fast, in order not to 
loose performance. 

• Message passing between processes. 

25 • Memory management, including garbage collection. 

Functional languages leave memory management to the nm-time system. Handling 
memory takes time from the processor and is generally regarded as a problem. 
When using the invention provided with for instance Erlang, this is done in the 
30 processor by a dedicated unit and interfering very little with the useful computing. 
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The processor according to the invention executes the instructions in a RISC like 
manner, i.e. in a sequential instruction flow easy to pipeline. Thus, the processor 
according to the invention is not a reduction machine. It has a simple combinatoiy 
instruction decoding, efficient execution for function calls, siqpport for creation and 
manipulation of lists and tuples, support for fast context switching, and high code 
density. This keeps the power dissipation on a low and economical level avoiding 
the need for costly cooling systems. 

BRIEF DESCRIPTION OF THE DRAWINGS 

For a more complete imderstanding of the present invention and for further objects 
and advantages thereof^ reference is now made to the foUoAving description taken in 
conjunction with the accompanying drawings, in which: 

FIG 1 illustrates a first embodiment of a dedicated processor architecture for 
handling functional languages working with a sequential iustniction flow, such as 
Erlang. 

FIG 2 illustrates a preferred embodiment of an instruction pipeline. 

FIG 3 illustrates an embodiment of the internal structure of the parameter 
memory included in the processor according to the invention. 

FIG 4 illustrates the hierarchy in the specification of the parameters stored in 
the parameter memory. 
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DETAILED DESCRIPTION OF EMBODIMENT 

The instruction set architecture (ISA) in the processor according to the invention is 
designed to be compact and efficient for the execution of functional languages and 
particularly for Erlang. However, the inventive concept is also adaptable to some 
other kinds of languages than functional. Some design goals for the ISA are identi- 
fied below: 

Execution in a RISC like manner, i.e. a sequential instruction flow easy to pipeline. 
Simple instruction decoding. 
Efficient execution for function calls. 
Support of fast context switching. 
High code density. 

Supporting garbage collection for lists ain tuples. 

There is preferably only one instruction format. Every instraction is preferably 
composed of a distinct number of sub-instructions, for example three, each having 
the same format. The first sub-instruction format comprises a first part, which de- 
termines the action to take, and a second part, which provides a value used in the 
action taken. The fibrst sub-instruction may determine the interpretation of the full 
instmction, or may allow other sub-instructions to instruct concurrent independent 
actions in a Very Long Instmction Word (VLIW) manner. 

Reference is now made to FIG 1, which illustrates an embodiment of the main ar- 
chitecture of the processor according to the invention, which fulfils the features 
mentioned above. The program for the processor is written into an instruction mem- 
ory 2. The program is stepped forwards by a program counter 1 providing an ad- 
dress to the instruction memory 2. 
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As illustrated in FIG 2, the processor architecture is in this embodiment basically a 
four stage pipeline, comprising the stages Fetch IF, Instruction Decode ID, Parame- 
ter Memory Access PMA and Execute EX, as will be further described below. Since 
stores can be pipelined a fifth store stage also could be regarded as a part of the 
pipeline. 

Returning to FIG 1, the instruction to be fetched is pointed out by the program 
counter 1. It is fetched from the instruction memory 2 by the instruction fetch 
mechanism IF. Therefore, the program coimter is connected to the instruction mem- 
ory 2 and feeds an address to the memory. The memory then returns the instruction 
pointed to by the address. 

The fetched instruction is stored in the memory 2 in coded form, i.e. in a form com- 
piled into the machine code of the processor, and is then fed to an instruction de- 
coder 3, which makes the Instruction Decode ID, Unconditional branches are de- 
tected during the decoding stage. The decoder 3 transfers the instruction into a 
number of control signals functioning as control words. 

The decoded instruction is then transferred into a parameter memory 4 stage, for the 
PaiametCT Memory Access PMA. This parameter memory 4 is a hardware means of 
a particular kind provided for this invention. Within this stage parameter references 
in tiie decoded instruction will be substituted by their corresponding actual values 
which are stored in the parameter memory 4, as will be apparent below in relation to 
FIG 3. 

Thus the processor does not operate with registers or a stack. It is instead operating 
with parameters. The parameters are temporary bindings to values, which are tem- 
porarily stored in the parameter memory 4 in the processor. Parameters are allocated 
according to need aad are freed upon completion of function application or upon 
e^Ucit instruction to be. Parameters are accessed according to context The maxi- 
mum number of parameters is not limited by the architecture but depends on the 
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Specialised processor implementation. This gives a good and efficient support for 
handling of function arguments and local variables/bindings. 

Parameters thus take a central part in the instruction set architecture of the processor 
5 according to the invention, as Avill be illustrated below. Parameters are used for ar- 
guments of functions and local bindings made within a function body, and for ar- 
gument transfer at function calls. 

The parameter memory 4 is designed to provide a fast instruction execution and 
10 stores current bindings, i.e. function parameters and local variables. This means that 
parameter transfer, local bindings and function results do not go through the main 
data memory 5 and instead through the fast parameter memory 4. 

Within this parameter memory stage certain decoded instructions will create new 
15 parameter bindings by storing actual values in the parameter memory 4 and creating 
parameter bindings to these values. Stored values may be provided from the de- 
coded instruction or may be provided from a register, or may be fetched from a data 
memory 5. Several operations of either kind of store and substitute may be done 
during each clock cycle. 

20 

During the parameter memory access stage parameter bindings in an instruction are 
replaced with actual values. The parameter memory 4 could preferably support at 
least two such replacements per clock cycle. After the parameter memory stage pa- 
rameter bindings in an instruction have been substituted with actual values where 
25 the actual values are fetched from the parameter memory. 

The data memory handler 6 keeps record of the addresses in the data memory 5, be- 
cause in functional languages, such as Erlang, the program is not working with ad- 
dresses and pointers as in imperative languages. Such features are hidden in a so 
30 called run-time procedure which means tiiat the data memory handler 6 is needed to 
detertnine and keep record of the addresses for the particular registers comprising 
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the data for the processes stored in the data memory 5. Addresses for stored data is 
delivered back to the execution unit 7. 

The parameter memory 4 feeds substituted iastruction data and code words to the 
execution unit 7, which executes the instruction. The parameter memory 4 com- 
prises also a register into which at least the current identity, id, of a process is 
stored, as will be discussed further when describing FIG 3. 

The parameter memory 4 is in fact the unit, which reads the registers in the data 
memory 5, while an execution unit 7 connected to the parameter memory 4 is the 
unit, which writes the data into the registers. The data memory handler 6 is con- 
nected to co-operate with both the parameter memory 4 and the execution unit 7. 

Similarly to the language Erlang the processor architecture is based upon processes. 
A first dedicated register in the parameter memory holds the current process id, cpr, 
A second dedicated register holds the current environment identity, env. An envi- 
ronment means a sequence of iastruction within which certain parameter references 
are valid, for instance the scope of the value bindings. 

The preferred stmcture of the parameter memory 4 is presented in FIG 3. It com- 
prises a process storage plane 10 in which the current process, process_info, is 
stored, an environment storage plane 11 in which the information of the environ- 
ments, environment_info, for the process is stored. 

While not directly visible in the instruction set, processes are used to keep track of 
processes (or tasks or threads) in the high level language executed on the processor. 
There is a correspondence between an Erlang process and a process in the proces- 
sor. A spawn instruction creates a new process, initiates its environment and associ- 
ates it with its process descriptor. Another instruction pushes a process on a process 
queue. Another instruction switches out the current process and switches in the first 
process in the process queue to become the new current process. 
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At least the following information is stored in the process descriptor: 

Pointer to first message. 

Pointer to last message. 

Current environment (when not executing). 

Other misc information, such as a list of linked processes. 

Also, the architecture is based upon the concept of environments. An environment 
(env) defines the current bindings in the parameter memory 4. Environments are 
used to keep track of parameter scope. A new environment, env, is created at the 
beginning of a function call and becomes the current environment. The parameter 
bonds in a function call are bonded in the new environment. At a function return the 
current environment is terminated, and the environment of the calling function is re- 
stored as the current environment, i.e. the current enviromnent is replaced with the 
previous environment All parameter bindings ia the replaced environment are 
purged at function return. 

As illustrated in FIG. 4, a parameter is valid only within its environment. Similarly 
an environment is valid only within its process. 

Thus, when an instmction is provided which makes a function call then env is 
stepped up with 1. When an instruction comes, which indicates a jump back then 
env is stepped down with 1, 

The parameter memory 4 also has a storage plane 12, for instance comprising a 
register crp, holding process identity for a certain parameter value and type stored in 
plane 13. As illustrated in FIG 3, searching for the parameter id, actual environment, 
ewv, and actual process is transmitted to the value storage 13. 
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The parameter memory 4 can perform a mmiber of actions. Examples of these ac- 
tions and their corresponding results and required inputs axe hsted below. 

Read: The parameter memory 4 returns the type and value, stored in the value 
5 storage plane 13, of the specified parameter, information of the parameter id, actual 
environment and actual process is provided to the storage planes 10, 11 and 12 in 
the parameter memory 4. 

Pop: The parameter memory 4 returns the value + type (stored in the storage 
10 plane 13) of the specified parameter. In addition it eliminates the parameter and its 
value from the parameter memory 4, Information of the parameter id, actual envi- 
roimxent and actual process is provided to the storage planes 10, 1 1, and 12 in the 
parameter memory 4. 

15 Set: The parameter memory 4 stores a new parameter with a specified id (in the 
type and value storage plane 13). Information of the parametet id, actual environ- 
ment, actual process and parameter value is provided to the planes 12, 11, and 10 in 
the parameter memory 4. 

20 Garb: The parameter memory eliminates the parameter and its value from the 
planes 10-13 in the memory. Information of the parameter id, actual envirorunent 
and actual process is provided to the parameter memory 4. 

Garb env: The Parameter Memory eliminates all parameters and their values from 
25 the planes 10-13 for the specified envirormient value, information of the environ- 
ment and the actual process is provided to the parameter memory 4. 

A read operation could, for example, work in the following way. The three leftmost 
storage planes 10, 11, 12 in FIG 3 perform an associative search. The line yielding 
30 hits in the three storage planes selects the value + type data stored for that line in the 
rightmost storage plane 13 in FIG 3. 
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The function is similar for storing, except that value + type data is stored instead of 
being read The garb and pop operations discard the information in the addressed 
position and make it free to use. 

5 

Thus, the parameter memory 4 is used to store values, and type information for 
these values. Parameters are bound to the values where the parameters are used to 
transfer values between functions and are used for temporal storage. The parameter 
memory 4 can replace parameter references in fetched instructions with stored ac- 
10 tual values before computation. 

Search in tiie files could be made in some kind of associative process. However, the 
search could be implemented in some other way as well, for example to use com- 
promised addresses by for example process and env. It is also possible to connect 
15 the process storage to an associative memory and the environment storage 11, Then, 
the parameter reference is fed to the parameter memory and makes a search. A value 
is out-putted. 

The parameter memory 4 can also manage environment information for the pa- 
20 rameters where the environment determines which ones of the stored parameters, 
which are the currently valid parameters* 

The parameter memory 4 could as well manage process information for the pa- 
rameters and environments where the process information determiues which ones of 
25 the stored environments, which are currentiy valid environments, and which of the 
stored parameters, which are currentiy valid parameters. 



30 



This means that parameters are used instead of registers or stack. This leads to more 
efiQcient handling of function calls and local bindiugs. A context dependent mecha- 
nism is used for addressing the parameters. This makes the iastruction set architec- 
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ture here described independent of the amount of parameter storage in a partictilar 
processor implementation. 

At least one process queue is administrated by the processor. Several queues may be 
administrated, for instance for different priority levels. 

Since the Parameter Memory 4 has a limited number of parameter slots it may not 
be sufficient to hold all parametere for a large program at the same time. 

If Ihe Parameter Memory 4 begins to reach its capacity limit, parameters begin to be 
stored in the data memory 5. thus freeing slots in the Parameter Memory 4. This ac- 
tion is called swap out of parameters. The opposite action, called parameter swap in, 
reads previously stored parameters from the data memory 5 and restores them in the 
Parameter Memory 4 with correct process, environment and parameter id values. 

Both parameter swap in and swap out are done automatically by the processor with- 
out interfering tiie instruction execution. 

The top of the process queue(s) is observed. If a process (PF) is found there which 
is not currently in the Parameter Memory 4, parameters for a process (which is not 
the cmrent process) in the Parameter Memory 4 become swapped out, and parame- 
ters in the topmost environments of the found process (PF) are swapped in to the 
Parameter Memory. 

A certain mechanism determines which process, which should have it s parameters 
swapped out. For instance. Least Recently Used (LRU) could be the strategy by 
which it is decided which process to swap out. 

If for the current process the number of environments in the Parameter Memory be- 
gins to reach the maxunum limit, an activity is started to swap out parameters of the 
lowest environment to the data memory. Similarly, if the number of environments in 



wo 01/06355 



19 



PCT/SEOO/01494 



the Parameter Memory begins to reach the lower limit, an activity is started to swap 
in parameters of the highest environment stored in the data memory (if any). 

The parameter swap in and swap out mechanisms ensures that the most likely to be 
used parameters are stored in the Parameter Memory 4, 

Regarding the memory management, the memory slots in the data memory 5 not 
used are organised as a list of available memory registers. A determined register 
(free) in the data memory handler 6 points to the first element of that list. Upon a 
request for storage of an element, this element is stored in the memory slot denoted 
by the mentioned dedicated register (firee). The register (free) is then updated to 
point to the now actual first available memory position, below called memory slot, 
which earUer was the second in the list Upon releasing a used memory slot this slot 
is placed in the beginiung of the list of available memory slots. 

As mentioned above Erlang handles data through the use of tuples and Usts and 
simple types. These data are referenced by bindings. When a data object is not ref- 
erenced by a binding, it is not needed any more and can be thrown away. Its mem- 
ory slot can then be used for other data objects. Thus, memory slots to which no 
reference is made are considered free to use. This means that an automatic garbage 
collection could be regarded as being made. Notice is kept of such memory slots, 
and they are released upon request. 

Memory management is thus supported well in tiiis architecture and is performed 
according to the principle of reference counting. This means that some sub- 
instructions control incremented reference and decremented reference, respectively, 
for data objects. When such a sub-instruction is executed the reference part of a data 
object is either incremented or decremented automatically by the imit for memory 
management If a zero reference occurs and is detected its memory slot is made free 
and available as free memory slot for usage of other data objects. 
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As mentioned above, the architecture of the parameter memory 4 is based on proc- 
esses and there will often be a need to switch from one process to another process. 
Then the following operations need be done. 

5 Upon a context switch the cmrent enidronment identity is stored in a Process (or 
Task) Descriptor pointed out by the process register (cpr). Other registers may be 
stored in the parameter memory if needed. When the process is switched back the 
register values can then be restored from the parameter memory. Thus, the only ac- 
cess to tile data memory 5 needed to accomplish a context switch is to store and re- 

10 store the environment 

In order to make a switch: 

♦ Bind data values of registers in the current working set to parameters and store them 
in the parameter memory 4. The value of the current enviromnent is stored in the 

15 process descriptor of the current process. Env then fimctions as a key for pointing to 
where in the parameter memory 4 a reading should begin at restore. 

• Read env and restore into the parameter memory 4 the new "working set" of the 
new process to which the switch is made from the process descriptor in the data 
memory 5 comprising the environment for that new process. 

20 • Restore the program coimter 1 to the value next after it was before the actual 
switched process were switched out. 

Following this scheme time consimiing memory access to the main memory 5 is 
itiiriiiiused, and fast context switching is achieved. 

25 

The instruction set architecture could for example be intended for execution of 
ftmctional languages, supporting the languages witii dedicated instructions for fimc- 
tion calls, function returns, transfer of arguments between fimctions, and process 
management. There are particular instructions for creating, reading, writing and ma- 
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nipulation of the data objects, which occur in ErlaQg, i.e. lists and tuples and simple 
types. 

Examples of instruction format 

instruction :: = sub-imtruction, sub-instruction, sub-instruction 
sub-instruction :: = tag, value 

where tag specifies how value should be interpreted. For example: par, 5 means that 
parameter number J is specified. This presents the logical organisation of the in- 
structions. In an actual processor implementation all tags can for example be 
grouped together. The tag in the first sub-instruction typically specifies how the full 
instruction shall be interpreted. If for example tag in the first sub-instraction speci- 
fies a binary arithmetic operation, the two other sub-instruction specify the oper- 
ands, and the value of the first sub-instruction specifies the type of arithmetic op- 
eration. 

There can be exceptions firom this rale. Some of the instructions in the first memory 
slot do only use the first two sub-instructions in which case the third slot can be 
utilised for a single sub-instruction, i.e. there could be a group of independent sub- 
instructions which only operates in the third slot in an instruction. Some independ- 
ent sub-instructions can be placed in the second slot as well. 

Examples of tag/value combinations are: 

Tag, value :: =fiin,funjtumber (call fimction junjtumbef) 

I par, parjmmber (use the specified parameter) 

I pop, register j^umber (return back from current function, deliver result in the 
specified register) 

I reg, registerjtumber (use the value in register jtumber) 
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I alu, aluj)peration (perform the operation given by 

alujDperaiiofi) 

I gar, parjaumber (free the space used by this parameter) 

I etc. 

Example 

The following registers could be used in the processor: 

stores the result from alu operations, 
stores the current process id, 
stores current environment, 

stores the memory address of the latest stored element, 
holds a result value when returning from a function, 
holds a data element Its parts can be addressed by dOl. d02, d03. d04, 
where dOl holds the element type value, d02 holds a numerical value, 
d03 and d04 holds pointer/integer values including value type, 
stores the &st free memory slot. 

The processor could use following not addressable registers: 

enV'Old: stores the previous environment 

cnt: stores the last allocated parameter niunber. 

As mentioned above, parameters are used for argument transfer at function calls, 
and for local bindings. When a function call is executed the arguments are listed in 
the second and third sub-instmctions in the calling instructions. The arguments get 
consecutive numbers as they appear starting the first sub-instruction iq a new func- 
tion body. 

A local binding is made through the par or pad instructions mentioned below, e.g. 



acc: 

cpr, 

env: 

id: 

res: 

dO: 

free: 
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(pab:5, reg:acc, * *) 

binds parameter 5 to the value in the register acc in the current environment in the 
current process. * represent arbitrary element 

There are also particular instructions for manipulating the references to data, i.e. to 
support the language Avith dedicated instructions to increment or decrement the 
number of memory references to a data object 

Experiences with this ISA (Instruction Set Architecture) has indicated a consider- 
able reduction of instructions in order to make a particular functionality. This means 
correspondingly fewer numbers of memory accesses. Memory accesses demand 
much power. A reduction of them will therefore lower the power consimiption in 
the system. 

Although the invention is described with respect to exemplary embodiments it 
should be understood that modifications can be made without departing from the 
scope thereof. Accordingly, the invention should not be considered to be limited to 
the described embodiments, but defined only by the following claims, which are 
intended to embrace all equivalents thereof. As mentioned above, the parameter 
memory 4 is based on processes. Therefore, this kind of device can be provided 
even for computers handling other kinds of languages using processes (or tasks or 
threads) than functional languages, for example C and C++. Context switching is 
also provided in for instance the language ADA, and thus the featm^es described for 
the context switching could also be valid for other kinds of languages handling 
context switching. Data management of the kind described above is also made in for 
instance the modem language Java and is therefore valid also for such kinds of lan- 
guages. 
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L A processor architecture adapted to program languages operating with a sequen- 
tial flow of instructions and handling data through use of simple values and lists and 
dynamically allocated arrays, and comprising an instmction holding means (2,3), a 
data memory means (5) storing data objects, and execution means (7), characterized 
by 

means (4) for handling simple values and references to data objects in dependence 
of an actual instruction from the instruction holding means(2,3), said dependence 
being called a binding; 

means (6) to increment reference counts to a data object and to decrement reference 
counts to a data object in dependence of an actual instruction from the instmction 
holding means(2,3), and in dependence of the means (4), which handles simple val- 
ues and references, storing a reference to said data object. 

2. A processor architecture according to claim 1 characterized by 

means (4) for handling storage of simple data and references to data objects in the 
means (6), said stored data and references to data objects being referred to, by 
means of identifiers, from instructions from the instmction holding means(2,3); 
storage means (13) in the means (4) for handling storage of simple data and refer- 
ences to data objects. 

3. A processor architecture according to any of the preceding claims, characterized 
in that the means (4) for handling the storage of values comprises a parameter 
memory means (4) having means (10,11,12) for keeping notice of the bindings to 
the stored values, and having storage means (13) for storing said values. 

4. A processor architecture according to claim 3, characterized in that said storage 
means (13) in the means (4) for handling temporary storage of values stores values, 
which comprises at least type information and value information. 

AMENDED SHEET 
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5. A processor architecture according to claim 4, characterized in that the parameter 
memory (4) is used to transfer and store function arguments, such that argument 
values are stored in the parariieter memory (4) in dependence of an actual instmction 

5 from the instruction holding means (2,3), then bond to certain identifiers in the in- 
structions, and that said data values are arranged to be read from the parameter 
memory (4) in dependence of said identifiers in an actual instruction. 

6. A processor architecture according to any of the claims 3 to 5, characterized by 
10 means (1 1) in the parameter memory (4) for storing and managing scope informa- 
tion for the stored parameters, where the scope information determines which pa- 
rameters are currently valid and eligible to be read out from the storage (13). 

7. A processor architecture according to any of the claims 3 to 6, characterized by 
1 5 means ( 1 0- 1 3) in the parameter memory (4) for storing and managing information 

for scope and values, where the means (10) is used for storing and managing infor- 
mation for scope and data values, where the process information determines which 
scopes and values are currently valid and eligible to be read out from the storage 



8. A process architecture according to any of the claims 3 to 5, characterized by 

a process identification register (PROCESS 1; PROCESS 2) for identification of the 

currently executed process; 

a scope identification register (ENV 1, ENV 2) for identification of the currently 



9, A process architecture according to any of the claims 3 to 8, characterized in that 
at least the top of at least one priority queue of processes to be executed are kept 
available for reading, and that at least part of the process descriptor of the next pro- 




(13). 
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valid scope (fig. 4). 
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cess to be executed are kept available for reading. 
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10. A process architecture according to claim 9 characterized in that in order to 
make a process switch the means (4,5,6) for handling values and references: 

* creates a new scope and at least the program counter is stored in the parameter 
memory (4) using said new scope; 

* stores said new scope value in the process descriptor of the current process, said 
process descriptor may be stored in the data memory (5); 

* restores the scope value for the process switch from the process descriptor of said 
process; 

* sets the process switch to be the current process; 

* reads at least the program counter from the parameter memory (4) and performs 
the restoring. 

1 1 . A process architecture according to any of the preceding claims, characterized 
by instmctions having only one instruction format, where each instruction is com- 
posed of a distinct number of sub-instructions, each of which has in turn the same 
and only one format comprising a first part and a second part, the first part deter- 
mining the action to take and the second part providing a value to use in that action. 

12. A process architecture according to any of the preceding claims adapted to exe- 
cution of languages using functions and dynamic memory allocation characterized 
by a set of instmctions comprising 

dedicated instructions for making fiinction calls, fimction returns, parameter transfer 
between functions, 

13. A process architecture according to claim 11, characterized by a set of instruc- 
tions comprising dedicated instmctions for incrementing or decrementing the num- 
ber of references to data objects stored in the data memory (5). 



14. A process architecture according to any of the preceding claims, characterized in 
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that it is adapted to process parts of computer programs written in a functional lan- 
guage. 
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